Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system

نویسندگان

  • Zdenek Jancik
  • Oldrich Plchot
  • Niko Brümmer
  • Lukás Burget
  • Ondrej Glembek
  • Valiantsina Hubeika
  • Martin Karafiát
  • Pavel Matejka
  • Tomas Mikolov
  • Albert Strasheim
  • Jan Cernocký
چکیده

This paper summarizes the BUT-AGNITIO system for NIST Language Recognition Evaluation 2009. The post-evaluation analysis aimed mainly at improving the quality of the data (fixing language label problems and detecting overlapping speakers in the training and development sets) and investigation of different compositions of the development set. The paper further investigates into JFA-based acoustic system and reports results for new SVM-PCA systems going beyond BUT-Agnitio original NIST LRE 2009 submission. All results are presented on evaluation data from NIST LRE 2009 task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The L2F Language Verification System for NIST LRE 2009

This paper presents a description of the INESC-ID’s Spoken Language Systems Laboratory (LF) Language Verification system submitted to the 2009 NIST Language Recognition evaluation. The LF system is composed by the fusion of eight individual sub-systems: four phonotactic systems and four acoustic based methods. Language recognition results have been submitted for the “closed-set”, “open-set” and...

متن کامل

BUT language recognition system for NIST 2007 evaluations

This paper describes Brno University of Technology (BUT) system for 2007 NIST Language recognition (LRE) evaluation. The system is a fusion of 4 acoustic and 9 phonotactic subsystems. We have investigated several new topics such as discriminatively trained language models in phonotactic systems, and eigen-channel adaptation in model and feature domain in acoustic systems. We also point out the ...

متن کامل

Multilevel and channel-compensated language recognition: ATVS-UAM systems at NIST LRE 2009

This paper presents the systems submitted by ATVS – Biometric Recognition Group at 2009 language recognition evaluation, organized by the National Institute of Standards and Technology of United States (NIST LRE’09). Apart from the huge size of the databases involved, two main factors turn the evaluation into a very difficult task. First, the number of languages to be recognized was the biggest...

متن کامل

A Study of the Influence of Speech Type on Automatic Language Recognition Performance

Automatic language recognition on spontaneous speech has experienced a rapid development in the last few years. This development has been in part due to the competitive technological Language Recognition Evaluations (LRE) organized by the National Institute of Standards and Technology (NIST). Until now, the need to have clearly defined and consistent evaluations has kept some real-life applicat...

متن کامل

Fusing language information from diverse data sources for phonotactic language recognition

The baseline approach in building phonotactic language recognition systems is to characterize each language by a single phonotactic model generated from all the available languagespecific training data. When several data sources are available for a given target language, system performance can be improved using language source-dependent phonotactic models. In this case, the common practice is t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010